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Abstract 



We study the problem of type inference for a family of polymorphic type disciplines con- 
taining the power of Core-ML. This family comprises all levels of the stratification of the 
second-order lambda-calculus by "rank" of types. We show that typability is an undecidable 
problem at every rank k > 3 of this stratification. While it was already known that typability 
is decidable at rank < 2, no direct and easy-to-implement algorithm was available. To design 
such an algorithm, we develop a new notion of reduction and show how to use it to reduce the 
problem of typability at rank 2 to the problem of acyclic semi-unification. A by-product of our 
analysis is the publication of a simple solution procedure for acyclic semi-unification. 
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1 Introduction 



Background and Motivation. Modern type systems for functional programming languages 
use polymorphic type inference. Type inference for untyped or partially typed programs saves 
the programmer the work of specifying the type of every identifier. Polymorphism lets the pro- 
grammer write polymorphic functions that work uniformly on arguments of different types and 
avoids the maintenance problem that results from duplicating similar code at different types. 
The first programming language to use polymorphic type inference was the functional language 
ML [GMW79, Mil85]. Due to its usefulness, many of the aspects of ML have been subsequently 
incorporated in other languages (e.g. Miranda [Tur85]). ML shares with Algol 68 properties of 
compile-time type checking, strong typing and higher-order functions while also providing type 
inference and polymorphism. 

The usefulness of a particular polymorphic type system depends very much on how feasible the 
tasks of type checking and type inference are. We define these concepts in terms of the untyped 
A-calculus, which we use as our pure functional programming language throughout this paper. By 
type checking we mean the problem of deciding, given a A-term M and a type r, whether r is one 
of the types that may be derived for M by the type system under consideration. By type inference 
we mean the problem of finding a type derivable for a A-term in the type system. The problem of 
type inference involves several issues: 

(1) Is typability decidable, i.e. is it decidable whether any type at all is derivable for a 
A-term in the type system? 

If typability is undecidable, then there is little more to say in relation to type inference. (Although 
a programming language may work around this problem by asking the programmer to supply 
part of the type information and by using heuristics, we will omit discussion of this possibility.) 
Otherwise, if typability is decidable, then it is possible to construct a type for typable A-terms, i.e. 
type inference can be performed, in which case we further ask: 

(2) How efficiently can typability be decided? How efficiently can type inference be 
performed? 

(3) Can a principal type (a "most general" type) be constructed for typable A-terms? 
The answers to these questions determine how feasible the type system is to implement. 

In addition to the feasibility of a particular polymorphic type system, its usefulness also depends 
on how much flexibility the type system gives the programmer. Although the polymorphism of ML 
is useful, it is too weak to assign types to some program phrases that are natural for programmers 
to write. To overcome these limitations researchers have investigated the feasibility of type systems 
whose typing power is a superset of that of ML. Over the years, this line of research has dealt 
with various polymorphic type systems for functional languages and A-calculi, in particular the 
powerful type system of the Girard/ Reynolds second-order A-calculus [Gir72, Rey74], which we 
will call by its other name, System F. In the long quest to settle the type checking and typability 
problems for F, researchers have also considered the problem for F modified by various restrictions. 
Multiple stratifications of F have been proposed, e.g. by depth of bound type variable from binding 
quantifier in [GRDR91] and by limiting the number of generations of instantiation of quantifiers 
themselves introduced by instantiation in [Lei91]. One natural restriction which we consider in 
this paper results from stratifying F according to the "rank" of types allowed in the typing of 
A-terms and restricting the rank to various finite values (introduced in [Lei83] and further studied 
in [McC84, KT92]). All of these systems improve on the expressive power of ML. 

Unfortunately, it is often the case that the more flexible and powerful a particular polymorphic 
type system is, the more likely that it will be infeasible to implement. As discouraging examples, 
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the problems of typability and type checking for many of the polymorphic type systems mentioned 
above have recently been proven undecidable. Type checking and typability were shown to be 
undecidable for System F (cf. recent results submitted for publication elsewhere in [Wel93]) and 
for its very powerful extension, System F u [Urz93]. Other related systems that are not exactly 
extensions of ML have also recently been proven to have undecidable typability, i.e. System F< 
which relates to object-oriented languages [Pie92], and the All-calculus which relates to extensions 
of A-Prolog [Dow93]. 

Against this recent background, it is desirable to demarcate precisely where the boundary 
between decidable and undecidable typability lies within various stratifications of System F. In the 
case of decidable typability, it is also desirable to develop simple and easy-to-implement algorithms 
for the most powerful level within a stratification that is also feasible to use. We undertake this 
task for the stratification of System F by rank of types. 

Contributions of This Paper. We can now firmly establish the boundary for decidability of 
typability and type checking within the stratification of System F by rank of types. The two 
problems are undecidable for every fragment of F of rank > 3 and are decidable for rank < 2. 
The undecidability of type checking at rank > 3 can be seen by observing that the proof for the 
undecidability of type checking in F in [Wel93] requires only rank-3 types. 1 The undecidability 
of typability at rank > 3 results from the fact that the constants c and / defined in section 5 
of [KT92] can be encoded using the methods of [Wel93] in System A 3 (the rank-3 fragment of F) 
and from Theorem 30 of [KT92]. We give this encoding in this paper. Since it was already known 
from [KT92] that typability is decidable for System A 2 (the rank-2 fragment of F), we know exactly 
where the boundary of decidability for typability lies. These circumstances lead us to look for a 
simple and direct algorithm for type inference within A 2 . 

The existing proof that typability is decidable for System A 2 uses a succession of several re- 
ductions to the typability problem in ML and results in a type inference algorithm that is neither 
simple nor easy to understand. In this paper, we give a simpler and more direct algorithm for 
the decidable case of typability in A 2 . We first prove that A 2 is equivalent to a restriction named 
System An - '* having many convenient properties. We then develop a notion of reduction named 
9 which converts A-terms into a form which is more amenable to type inference but which also 
preserves every A-term's set of derivable types in An - '*. The type inference problem in A^'* for a 
A-term in ^-normal form is easily converted into an acyclic semi-unification problem. Finally, we 
give a simple algorithm for solving acyclic semi-unification problem. The complexity of the whole 
procedure is the same as that of type inference in ML. 

We omit all proofs of all lemmas and theorems in this conference report to remain within the page 
limit. A later extended version of this paper will clarify the relationship between ML-typability 
and typability in A 2 and discuss the issues of type checking and principal types in A 2 . 

Acknowledgements. A number of definitions used in this paper were lifted from [KT92, KTU90, 
KTU93]. 

2 System and System 

In this section, we define first the untyped A-calculus, then System F, then the restriction of System 
F that results in System A k . Then, we define a restriction of System A 2 called System A^ which 
has equivalent typing power. We use the "Curry view" of type systems for the A-calculus, in which 

1 In second version of report to appear mid-December 1993. 
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pure terms of the A-calculus are assigned types, rather than the "Church view" where terms and 
types are defined simultaneously to produce typed terms. 

The set of all A-terms A is built from the set of A-term variables V using application and 
abstraction as specified by the usual grammar A ::= V | (A A) | (AV.A). We use small Roman 
letters towards the end of the alphabet as metavariables ranging over V and capital Roman letters 
as metavariables ranging over A. When writing A-terms, application associates to the left so that 
MNP = (MN)P. The scope of "As." extends as far to the right as possible, while the scope of 
"As" without the "." covers as little as possible. 

As usual, FV(M) and BV(M) denote the free and bound variables of a A-term M. By M[x:=N] 
we mean the result of substituting N for all free occurrences of x, renaming bound variables in 
M to avoid capturing free variables of N. We will sometimes use this substitution notation on 
subterms when we intend free variables to be captured; we will distinguish this intention by the 
proper use of parentheses, e.g. in Xx.(N[y:=x]) we intend for the substituted occurrences of x to 
be captured by the binding. A context C[-] is a A-term with a hole and if M is a A-term then 
C[M] denotes the result of inserting M into the hole in C[-], including the capture of free variables 
in M by the bound variables of C[-]. We denote that N is a subterm of M (possibly M itself) 
by N C M. We assume at all times that every A-term M obeys the restriction that no variable is 
bound more than once and no variable occurs both bound and free in M. K denotes the standard 
combinator (Xx.Xy.x). 

The set of all types T is built from the set of type variables V using two type constructors 
specified by the grammar T ::= V | (T— >T) | (VV.T). We use small Greek letters from the 
beginning of the alphabet (e.g. a and fi) as metavariables over V and small Greek letters towards 
the end of the alphabet (e.g. a and r) as metavariables over T. When writing types, the arrows 
associate to the right so that o—*T—*p = a—^(T—^p). We use the same scoping convention for 
"V" as we do for "A". FTV(r) and BTV(r) denote the free and bound type variables of type r, 
respectively. We give the notation <7[a:=r] the same meaning for types that it has for A-terms. We 
write a < t to indicate that a can be instantiated to r, i.e. a = \fa.p and there exist types x such 
that p[a:=x] = t. ^° denotes that the types x in the substitution contain no quantifiers. We write 
_L to denote the type Va.a. 

We have several conventions about how quantifiers in types are treated, a-conversion of types 
and reordering of adjacent quantifiers is allowed at any time. For example, we consider the types 
Va.V/3.a fi, MfiNa.fi a, and V/3.Va.a —^fito all be equal. Using a-conversion we assume that 
no variable is bound more than once in any type, that the bound type variables of any two type 
instances are disjoint, and that all bound type variables of any type instance are disjoint from the 
free type variables of another type instance. If a = Va.r and a ^ FTV(r), we say that "Va" is a 
redundant quantifier. We assume types do not contain redundant quantifiers. 

We define a notation for specifying many quantifiers concisely. For type a and set of type 
variables X C FTV(ct), the shorthand notation MX. a is defined so that V0.<7 = a and V(X U 
{a}).<7 = Va.V(X — {of}). a. This defines just one type because we assume the order of quantifiers 
does not distinguish two types. We may use a to stand for a sequence of type variables a l5 . . ., 
a n . We allow a to be treated as a set or as a comma-separated sequence as is most convenient, so 
Va.CT has the expected meaning. The notation V.ct means V(FTV(<t)).<7. 

To define System A k , we will use the following inductive stratification of types. First define 
i?(0) as the set of open types, i.e. types not mentioning V. Then, for all k > 0, define R(k + 1) by 
the grammar R(k + 1) ::= R(k) \ (R(k)^R(k+ 1) ) | ( V V. R{k + 1)). We say that R(k) is the 
set of types of rank k. For example, Ma.a—^Mfi.a—^fi is a type within rank 1 and (Va.a— >-a)— >-V 'fi.fi 
is a type within rank 2 but not within rank 1. Our definition of rank is equivalent to the notion of 
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A\- x : a 



A(x) = a 



APP 
ABS 
INST 



A h M : a-rT, A\- N : a 
A h (M N) : t 

A U {x:a} h M : r 
A h (Aa?.M) : d^r 

AhM: Ma.a 



A h M : o-[a:=r] 

ihl:(7 
AhM: Va.a 

Figure 1: Inference Rules of System F and A k . 



GEN l > ' l) l/ : / T . " * 1 TVi.U 



rank introduced in [Lei83]. Since R(k) C R(k + 1) it follows that if a type <7 is within rank k, then 
it is within every rank n > k. Observe that performing the substitution <7[a:="r] may not preserve 
rank. The resulting rank depends on the rank of r and how deep in the negative scope of — ► the 
free occurrences of a in a are. 

To define AJf, we will use subsets of the type sets i?(0), -R(l), and i?(2) called 5(0), 5(1), and 
5(2). Let 5(0) = R(0) be the set of all open types. Let 5(1) be the set of all types of the form 
Va.cT, where a £ S(0). Let S(2) be the set of all types of the form Va.Ci— ► • • • —^a m —^T , where 
d!, . . . , <7 m G 5(1) and r £ 5(0). 

An assertion is an expression of the form A\- M : r where A is a type assignment (a finite set 
{xi : d!, . . . , x n : <7 n } associating at most one type a with each variable x), M a A-term and r a 
type. We say this assertion's type is the type Ci— ► • • -^<7 n ^r and an assertion's ranfc is the rank 
of its type. An assertion A h M : r is within rank 2 if and only if r is within rank 2 and all the 
types assigned by A are within rank 1. A(x) denotes the unique type a such that that (x:a) £ A. 
FTV(A) is the set of all free type variables in all of the types assigned by A. The notation A[a:=x] 
denotes a new type assignment A' such that if A(x) = a then B(x) = a[a: = ~x\. We assume that 
throughout an assertion it is the case that all bound type variables are named distinctly from each 
other and that the bound and free type variables do not overlap (satisfied by a-conversion). 

We define System F to be the type system that can derive types for A-terms using the inference 
rules presented in Figure 1 with no other restrictions. For every k > 0, we define A k as the 
restriction of F which allows only assertions within rank < k to be derived. We define System 
Ajf as a restriction of System A 2 where the two differences are that (i) in Ajf all assertions must 



, NST - r € S(0) 

A h M : a[a:=T] V ; 



Figure 2: INST : Replacement for INST in A 2 . 
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have types in S(2) (thus all assigned types are in S(l) and all derived types in S(2)) and (ii) that 
the inference rule INST of A 2 is replaced by the rule INST - described in Figure 2. Theorem 9 
in [KT92] shows that A 2 types the same set of terms as A 2 with very similar types. Since A 2 is 
as powerful as A 2 and since its restrictions make analysis of type inference easier, we will use it 
instead of A 2 in this paper. 

If K, is a type inference system, then the notation A h K M : r denotes the claim that A\- M : r 
is derivable in /C. 



3 System Typability Undecidable for k > 3 

Section 5 of [KT92] introduces System A^fC^] for each k > 3 and Theorem 30 of the same pa- 
per proves that typability is undecidable for A^fC^] for k > 3. The original definition of A^fC^] 
defined it based on A k by adding two constants, c and /, with predefined types <f) Cik and 4>j ik . 
A simple alternate definition is to declare that A h M : r is derivable in A^fC^] if and only if 
A U {c:cj) Cik , f:cj)f ik } h M : r is derivable in A k . 

The analysis for A 3 [C 3 ] goes as foUows. For A 3 [C 3 ] the types of the constants c and / are 
^> c3 = V ' a.a—^(((a—^a)—^a)—^a) and ^ 3 = \la.(a—*et)—^((a—^a)—^a). We construct a context 
C 3 [ • ] with one hole: 



J,[- 


= (Xyi.iXzi.riyiyiiyiZijjj^X 


Xi.Kxi(K(xi(xir))[ ■ ]))(Xwi.WiWi) 


D[- 


= (Xf.r(x 1 (fx 1 x 1 ))(x 2 (fx 2 x 


2 ))[-])(Xu.Xv.u(v(u(ur)))) 


E[- 


] = (Ai.r(a; 1 (te 1 (a; 1 r)(/a; 1 )))(: 


c 2 (tx 2 (x 2 r)(fx 2 )))[ ■])(Xp.Xq.Xs.K(p(pq))(p(sp))) 


GI- 


= (Xc.r(x 1 (c(x 1 r)(fx 1 )))(x 2 


{c(x 2 r)(jx 2 )))[.])(tr) 


GA- 


] = Xr..h[.h[D[E[G[ •]]]]] 





Using the methods of [Wel93], it can be seen that this context can be typed in A 3 and in any typing 
of this context (with any A-term placed in the hole), the variables c and / must be assigned the 
types 4> c>3 and (^ /)3 . 

Since for each k > 3 a context C^[-] having the same properties with respect to A k can be 
constructed, we have this result: 

Theorem 3.1 For any type assignment A, there exists a type t such that AU{c:cj) Cik , /:^ fc }h A3 M : 
t is derivable if and only if there exists a type t' such that A\- A3 C k [M] : r' is derivable. Thus, the 
problem of typability for A k [C k ] for k > 3 is reducible to the problem for A k . Therefore, typability 
is undecidable for A k for every k > 3. 

4 System A^~'* 

In this section, we observe a number of convenient properties of System A 2 . We then define System 
AJf'* as a restriction of AJf that embodies these properties and prove that AJf'* is equivalent to A 2 . 

Definition 4.1 (act) (Taken from [KT92].) Let us define, by induction on A-terms M, the se- 
quence act(M), of active variables in M: 

1. act(x) = e (the empty sequence) 

2. act(Xx.M) = x ■ act(M) 

q ,/„. An / £ if act(M) = s 

$ - WWM * ,= if ««*> = »,...*.,*. rcm.,*! 
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Let us observe that due to our conventions, there are no repetitions of variables in act(M). 
The sequence act(M) represents outstanding abstractions in M , i.e. those abstractions which have 
not been "captured" by an application. For each application subterm Q = RS in a A-term M 
where act(R) = x ■ ■ ■ , there is an abstraction subterm N = (Xx.P) within R (possibly R itself). In 
this case, we say that the subterms N , Q, and S are companions, specifically, N is the companion 
abstraction, Q the companion application, and S the companion argument of the others. In this 
case, if N is ever /3-reduced, its argument will be S or 5"s /3-descendent. If N = R, i.e. Q = N S , 
then we say that they are adjacent companions and it is the case that they are a /3-redex. A 
set of non-adjacent companions represents a "potential" /3-redex in a A-term whose presence can 
be detected by simple inspection without /3-reduction. Companions turn out to have convenient 
properties in AJf. 

Definition 4.2 (( ) A ) For a A-term M, we define (M) A as the effect of traversing M and labeling 
each of its abstraction subterms with an index i £ {1,2,3}, depending on the subterm's position 
and whether it has companions. (M) A is defined in terms of an auxiliary function label which takes 
as parameters a A-term, a set of variables, and an index. The inductive definition of label follows 
for i e {1,2,3}: 

1. label(x,X, i) = x 

s [(Xx.labeUM.XA)) ifxeX, 

2. label((Xx.M),X,i) = \ K I > > ^ 

U ; ; \(X 1 x.label(M,X,i)) if x £ X 

3. label((MN),X, i) = (label(M, X, i) ■ label(N, act(N),3)) 

We then finish the definition by saying that (M) A = label(M, act(M),2). 

Informally, labeling the A-term M affects each abstraction subterm N as follows. If N has 
companions, then it is labelled with A 1 . If TV does not have companions, then it is labelled with A 2 
if there is no subterm P = LR of M such that N lies within R, the right subterm. Otherwise N is 
labelled with A 3 . When dealing with a labelled A-term M after this point, we will assume that the 
labeling is the result of the ( ) A operator and not any arbitrary labeling, i.e. we assume that either 
M = (iV) A or M C (N) x for some unlabeled A-term N. 

Lemma 4.3 If V is a derivation in AJf that types the X-term M , and there is an abstraction 
subterm (Xx.N) in M , and there is a subterm (PQ) in M such that x appears in act(Q), and there 
is an assertion A U {x:a} h N : r in V, then a £ S(0). Restated more informally, the bound 
variable of a companionless, X s -labelled abstraction must be assigned a monomorphic type. 

Lemma 4.4 If in AJf there is a derivation V ending with the assertion A h M : t, then for 
any type variable substitution [a:=x], it is the case that there is a derivation V ending with the 
assertion A[a:=x] l~ M : r[a:=x] and, furthermore, V and V are of the same length and there is 
a one-to-one correspondence between rule applications in both derivations. 

Lemma 4.4 is used by Lemma 4.5. For Lemma 4.5, let us temporarily suppose that quantifiers 
introduced into types by the GEN rule are marked. For example, from the assertion A h M : r 
where a £ FTX(A) we can derive using GEN the assertion A h M : V^a.r. These markers on 
quantifiers do not affect the behavior of the inference rules; they merely allow us to precisely 
phrase the lemma. 

Lemma 4.5 If in AJf there is a derivation V ending with the assertion A\- M : t, then there is 
a derivation V ending with the same assertion such that there is no use of the INST rule whose 
premise is an assertion of the form B h N : a. p. In plainer English, we can assume that 
quantifiers introduced by GEN are never instantiated. 
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VAR* A^x:Va.T A(x) <° r, r G 5(0), a FTV(A) 

APP* AhM A^MN)^L N '' a nT Si,,! - • «^ FTV ^) 

APP-+ Ah ^; g (MW)=Vg!"/ :g TG5(0) ' ^ M )^ £ ' «*FTV(A) 

ABS *' 3 4 ^ U J"^ h w M:T ^£5(0), 5*FTV(A) 

ih A 3 i.M :Va.(T^r ' w ' * v ; 



Figure 3: Inference Rules of System A 



Lemma 4.6 If V is a derivation in AJT that types the X-term M , and V includes the assertion 
A h N : Va.r, (roc? i/iere are no subsequent assertions in V for the subterm N that are derived from 
this assertion, then either N = M or there is a subterm (PN) in M where act(P) ^ e. Rephrased, 
the only proper subterms for which the final derived type may be a \f-type are companion arguments. 

Lemma 4.7 results from Lemmas 4.5 and 4.6. 

Lemma 4.7 If V is a derivation in AJT that types the X-term M , and V includes the assertion 
A h N : Va.r as a consequence of the GEN rule, then N is a companion argument. 

Lemma 4.8 If in AJT there is a derivation V ending with the assertion A\- M : t, then there is 
a derivation V ending with the same assertion such that if the assertion B h N : a in V is the 
consequence of the INST rule, then N £ V, i.e. N is a variable. In other words, we can assume all 
uses of the INST rule occur at the leaves of the derivation (viewing the derivation as a tree). 

We now define the new System AJT'* to formally include the restrictions proven by the previous 
lemmas into a type system. We present the inference rules for AJT'* in Figure 3. As in AJT, all 
assertions are required to be within rank 2. 

Theorem 4.9 Ah ( - A -- ) M : r holds if and only if Ah ( - A -,.- ) (M) A : r holds, i.e. every AJT typing is 
equivalent to a AJT'* typing and vice versa. 

5 ^-Reduction and System A^'*' 0 

In this section, we define a new notion of reduction and then use it to reduce System A J'* typability 
to an even more restricted type discipline, System AJT'*' . 

Definition 5.1 (9) We define 4 notions of reduction denoted # l5 9 2 , 0 3 , and 9 A which will transform 
a labelled A-term (M) A in a useful way. These transformations are defined as follows: 

• 9i transforms a subterm of the form (((X 1 x.N)P)Q) to ((A 1 a;.AQ)P). 
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• 0 2 transforms a subterm (X 3 x.(X 1 y.N)P) to ((X 1 v .X s x .(N[y:=vx]))(X 3 w .(P[x:=w]))) , where 
v and w are fresh variables. 

• 0 3 transforms a subterm of the form (^((A^.P)^)) to ((\ 1 x.N P)Q). 

• 0 A transforms a subterm of the form ((A 1 a;.(A 2 y.iV))P) to (A 2 y.((A 1 a;.iV)P)). 

Capture of free variables in # l5 0 3 , and 0 A does not occur due to our assumption that all bound 
variables are named distinctly from all free variables. # l5 0 3 , and 0 A affect subterms that are 
applications, while 0 2 is applied to subterms that are abstractions. When A-terms are viewed as 
trees, # l5 0 2 , and 0 3 can be seen to have the effect of hoisting /3-redexes higher in the transformed 
term, while 0 A has the effect of raising an abstraction above a /3-redex. In section 6, we will 
use properties of these transformations to prove that a typability problem is reducible to acyclic 
semi-unification. 

We use the notation 0, where i £ {1,2,3,4} to stand for one of # l5 0 2 , 03, or 0 A . We define 0ij 
to be 0i U 0j and define 0 = #1,2,3,4- Since these transformations are all notions of reduction, —^e 1 , 
—^ ei 2 , — etc., have the expected meaning. 

We say that a term is in 0 -normal form if it has no #-redexes. A ^-normal form of M is a 
A-term N in ^-normal form such that M —»g N . A A-term may have more than one ^-normal 
form, e.g. the A-term (((Xx.M)N)((Xy.P)Q)) has two ^-normal forms, ((Xx.(Xy.MP)Q)N) and 
((Xy.(Xx.MP)N)Q). 

We now prove a variety of useful properties of ^-reduction. 

Lemma 5.2 Let M be in 0 -normal form. M is of the form: 

X 2 x 1 .X 2 x 2 X 2 x m .(X 1 y 1 .(X 1 y 2 .(. . . ((A 1 y n .T n+ i)T n ) . . .))T 2 )T 1 

where m > 0, n > 0, and where T 1; P n +i are X-terms in fi -normal form. Furthermore, any 
abtractions within Ti for 1 < i < n + 1 are X s -labelled. Thus, all X 1 -labelled abstractions belong to 
(3-redexes, i.e. there are no non-adjacent companions. 

The A-term M detailed in Lemma 5.2 can also be viewed as the following ML term: 

fn Xi =>■ fn x 2 =>■ . . . =>■ fn x m =>■ let y x = Pi in let y 2 = T 2 in ... let y n = T n in S 

Lemma 5.3 0 1; 0 2 , 03, and 0 A always transform a X-term M into a f) '-equivalent X-term N , i.e. if 
M N, then M = p N. 

Lemma 5.4 6 -reduction always terminates, i.e. it is strongly normalizing. 

Lemma 5.5 We can assume that the type assigned to the bound variable of a X 1 -abstraction which 
is the function of a (3-redex has no free type variables that are not also free somewhere else in the 
type assignment. 

Lemma 5.5 is used by Lemma 5.6. 

Lemma 5.6 If 0\, 0 2 , 03, or 0 A transform M into N in one step, then with any particular type 
assignment, both M and N are typable with the same types in A 2 '* . In other words, if M N , 
then in K 2 '* it holds that A\- M : 7r is derivable if and only if A\- N : 7r is derivable. As a result, 
Ah ( - A -,.- ) M : t is true if and only if A\-^ A -,,^0-nf(M) : r is true. 

Lemma 5.7 act(0-nf((M) x j) = act(M). 
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VAR" A h x : t A(x) <° r, r G 5(0) 

AhM: <T->r, A h iV : ct ^ , , , 

APP AHMN):t (1 - 7 Si,,! - 

LET * AU{s:V. g }hM:r, A h A : cx 

ABS M AU{s:V. g }hM:r 



ABS 



,2 



A U {x:±} h M : r 
A h (A 2 a:.M) : _L^r 



A h (A 3 a;.M) : <j->t ' v ; 



Figure 4: Inference Rules of System A 



Lemma 5.8 (From [KT92].) In A 2 '*, if A h M : p is derivable and \act(M)\ = n, then p = 
Ma.Gi—^ . . .—fa n —fT and a G S(l) and t G 5(0). 

Lemma 5.9 We can always assign the type _L = \fa.a to the bound variable of a companionless, 
A 2 -labelled abstraction without affecting the whole X-term's typability. 

Lemma 5.10 Under the restriction that the outermost type assignment assigns the type _L to all 
variables, we can always assign universally closed types to the bound variables of every X 1 -labelled 
abstraction without affecting the whole X-term's typability. 

We now define System h^'*' e to take advantage of the typing properties of A-terms in ^-normal 
form in AJf'*. System AJf'*' is intended to be used only for ^-normal forms; its behavior on other 
A-terms has not been investigated. The inference rules for AJf'*' are presented in Figure 4. As with 
AJf'*, assigned types must be in S{1) and derived types must be in S(2). 

Theorem 5.11 Typability and type inference in AJf'* are reducible to the same problems in AJf'*' e . 
For a labelled X-term M where \act(M)\ = n, if Ah ( - A -,.- ) M : Ma.ai—^ ■ ■ ■ —*o n —*T holds, then using 
the type assignment B that maps all variables to type _L it is the case that B\-^ A -,*,e^9-nf(M) : 
J.— ► • • - —^L—^t holds as well. If C\-^ A -,*,s^9-nf(M) : p holds, then Ch^ A -,.^M : p must hold as well. 

6 System A^'*' 0 Type Inference Reducible to ASUP 

In this section we define the problem of acyclic semi-unification, give an algorithm for solving 
it, and develop a construction for reducing the problem of typability in System AJf'*' to acyclic 
semi-unification. 

For convenience, we define semi-unification using the set of open types R(0) as the set of 
algebraic terms T. Let X = V denote the set of term variables to emphasize their use in algebraic 
terms as opposed to types. Although the members of T are also types, we will refer to them as 
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terms when using them in semi-unification. A substitution is a function S : X — ► T that differs 
from the identity on only finitely many variables. Every substitution extends in a natural way to 
a -^-homomorphism S : T — ► T so that ^c— >-t) = 5 , (o')^5 , (r). An instance T of semi-unification 
is a finite set of pairs (called inequalities) in T X T. Each such pair is written as r < fj, where 
r, /i G T. A substitution 5 is a solution of instance T = {ti < /ii, . . . , r n < //„} if and only if there 
exist substitutions . . . ,R n such that: 

RiiSin)) = Sifii) , . . • , R n (S(T n )) = S(ji n ) 

For an arbitrary term r, we define the left and subterms of r, denoted L(t) and R(t). More 
precisely, if r is a variable then L(t) and -R(r) are undefined, otherwise we set i(r 1 ^r 2 ) = r 1 and 
R(t 1 —^t' 2 ) = t 2 . If II G {L, R}* , say II = • • • x p , the notation II(r) means Xi(x 2 (- ■ ■ (x p (t) •••))• 
For an arbitrary II G {L,R}*, the subterm II(r) is defined provided II (read from right to left) is 
a path (from the root to an internal node or to a leaf node) in the binary tree representation of r. 

An instance T of semi-unification is acyclic if it can be organized as follows. There are n + 1 
disjoint sets of variables, V 0 , . . . , V n , for some n > 1, such that the inequalities of T can be placed 
into n columns: 



r l,2 < ^1,2 r 2,2 < ^2,2 r n,2 < ^ 



r l,r 1 < ^l.n r 2,r 2 < ^2,r 2 r n,r„ < ^n,r„ 

where: 

V 0 = FVir 1 ' 1 ) U ••• U Fy(r 1,ri ) 

Vi = FViji 1 ' 1 ) U ••• U J FF(// 1 ' ri ) UFV(t 2 ' v ) U ••• U ^(r 2 ' 1 " 2 ) 

T/.! = FViji"- 1 ' 1 ) U ••• U J FF(^ n - 1 ' r "- 1 ) U ^(r"' 1 ) U ••• U Fy(r"' r ") 
K = Fyf/i"' 1 ) U ••• U FV{p n ' r *>) 

The acyclic semi-unification problem (henceforth abbreviated ASUP) is the problem of deciding, 
for an ASUP instance T, whether T has a solution. 

We now define a procedure which constructs a solution for ASUP instance T if T has a solution 
and otherwise answers that there is no solution. This procedure is a modification of the procedure 
defined in [KTU93] which consists of repeatedly reducing redexes, which can be of two kinds, and it 
halts when there are no more redexes or when a conflict is detected that precludes a solution. Each 
reduction substitutes a term for a variable throughout T and the composition of the reductions 
done so far represents the construction of the solution. 

• (Redex I reduction) Let £ G X and let r' (j£ X be a term with the property that there is a 
path II G {L, R}* and r < fj, is an inequality of T such that: 

n(r) = t' and II(/z) = £ 

The pair of terms (£,T(t')) where T is a one-to-one substitution that maps all variables in 
t' to fresh names is called a rerfea; 2. Reducing this redex substitutes F(t') for all occurrences 
of £ throughout T. 
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• (Redex II reduction) Let £ G X and // G T have the property that £ ^ fj,' and there are paths 
II, A, X G {£, P}* and r < fj, is an inequality in T such that: 

n(r) = A(r) G X and EII(/x) = £ and EA(/z) = fi' 

Such a pair (£, //) is called a rerfea; 27. Reducing this redex consists of substituting // for all 
occurrences of £ throughout T. However, if there is a path 0 G {L,R}* such that 0(/i') = £, 
then no solution to T is possible, so the procedure halts and outputs the answer that there is 
no solution if this is detected. 

Although the general case of semi-unification was proven to be undecidable in [KTU93], we 
have the following result for ASUP: 

Lemma 6.1 For an instance T of ASUP, the redex procedure either constructs a solution S to T 
and halts or correctly answers that T has no solution and halts. 

To solve the typability and type inference problems for K^'*' 0 for A-terms in ^-normal form, we 
construct an ASUP instance T. Consider the labelled A-term M in ^-normal form: 

M = X 2 Xl .X 2 x 2 . . . .A 2 a> m .(AV(AV(- . .((AVr„ +1 )r„) . ..))T 2 )T 1 

We will adopt the convention that the abstractions in the component T; bind variables named z,^, 
Zi y2 , etc. By writing the inequality (r < 8 - /i), we assert that the inequality will belong to column i 
of r. Most of the inequalities will be of a certain special form, so (r =, /i) denotes the inequality 
(a—^a <i t—^h) where a is a fresh variable mentioned in no other term in T. This will have the 
effect of unifying r and fj, as in ordinary first-order unification. We will assume that the subterms 
of M are indexed so that two otherwise identical subterms in different positions within M will be 
considered distinct in what follows. 

We construct T as follows. In constructing the instance T of ASUP, each subterm N C Ti will 
contribute one inequality, each /3-redex ((A^-.P;)!;) will contribute one inequality, and for each 
variable there will be 1 + n — i inequalities. For each subterm N of Tj, the term variable S N 
will represent the derived type of N . For each bound variable z it j (which must be monomorphic), 
the term variable 7^ will represent its assigned type. For each bound variable (which must 
be universally polymorphic), the term variables /3j j, . . ., /3 n 8 ' will represent its assigned type. For 
each occurrence of Xj (which will be assigned the type _l_), there will be no particular variable to 
represent its type, since it is unconstrained. 

Now we define the inequalities that will be in T. For each subterm N of Tj, we add an inequality 
to r that will depend on N: 

1 . For N = Xj, we do not add any inequality. 

2. For N = yj, we add (A-ij <i <*>jv). 

3. For N = Zij, we add (7^ =, S N ). 

4. For N = (PQ), we add (S P =,- 6 Q ->6 N ). 

5. For N = (X 3 Zij .P), we add {"jij—^ip =i S N ). 

For each /3-redex ((A^-.P;)!;), we add the inequality (/^ =, St,)- Finally, for each bound variable 
yj and for each i G {j ' + 1, . . . , n + 1}, we add the inequality (A-ij < 

Theorem 6.2 The ASUP instance T has a solution S if and only if the X-term M in 9-normal 
form is typable in AJf'*' . Furthermore, if M is typable in AJf'*' , the type _L— ► • • •— ►(,5'(# T )) 
where the number of "L" components of the type is m (the number of variables in the sequence x 1; 
. . . , x m , also the value of \act(M)\) is a type derivable for M in AJf'*' . 
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We can finally descripe our type inference algorithm for System A 2 . If M is typable in A 2 , then 
the following procedure will produce a type for it and will otherwise answer that M is not typable: 

1. Compute the labelled M 1 = (M) A . 

2. Compute the A-term M 2 = #-nf(Mi) using ^-reduction. 

3. Compute the ASUP instance T. 

4. Run the redex procedure on T to either produce a solution S for T or the answer that T has 
no solution. In the latter case, halt with the answer that M is not typable in A 2 . 

5. Compute and output the type _L— ► • • • — ( 5 where the number of "_L" components 
is \act(M)\. 

It was shown in [KT92] that A 2 typability is DEXPTIME-complete (where DEXPTIME means 
DTIME(2 n 1 ')). We have just developed an algorithm that reduces A 2 type inference to ASUP 
in polynomial-time. ASUP was shown to be DEXPTIME-complete in [KTU90]. Therefore, our 
algorithm is optimal. 
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